A Review of Research in Automatic Language Identi cation
نویسنده
چکیده
1 Introduction The problem of automatic language identication|identifying the language being spoken by an unknown talker from a short excerpt of speech|is a challenging and important one, of interest to linguists and computer speech researchers. This document reviews the studies done so far in this area.
منابع مشابه
Automatic Language Identi®cation
Automatic language identi®cation of speech is the process by which the language of a digitized speech utterance is recognized by a computer. In this paper, we will describe the set of available cues for language identi®cation of speech and discuss the dierent approaches to building working systems. This overview includes a range of historical approaches , contemporary systems that have been ev...
متن کاملLanguage identi cation of web documents using discrete HMMs
Automatic language identi cation in written text documents is an issue which deserves signi cant attention in the context of the ever-growing volume of web documents. This paper deals with language identi cation in the domain of electronic texts related to tourism. The proposed system is built on Hidden Markov Models (HMMs) that enable the modeling of character sequences. For this purpose, a pa...
متن کاملSECURING INTERPRETABILITY OF FUZZY MODELS FOR MODELING NONLINEAR MIMO SYSTEMS USING A HYBRID OF EVOLUTIONARY ALGORITHMS
In this study, a Multi-Objective Genetic Algorithm (MOGA) is utilized to extract interpretable and compact fuzzy rule bases for modeling nonlinear Multi-input Multi-output (MIMO) systems. In the process of non- linear system identi cation, structure selection, parameter estimation, model performance and model validation are important objectives. Furthermore, se- curing low-level and high-level ...
متن کاملAutomatic Identification of European Languages
We describe our word-based implementation of a language identifying system for the text messages written in European languages. Speci cally, we use and compare linguistic (based on functional words) and statistic (based on the word frequency) approaches to construction of the identifying vocabularies. Our version of the statistic approach copes with the di erences in degrees of word overlap amo...
متن کاملAutomatic Sublanguage Identi cation for a New Text
A number of theoretical studies have been devoted to the notion of sublanguage which mainly concerns linguistic phenomena restricted by the domain or context Furthermore there are some successful NLP systems which have explicitly or implicitly addressed the sublanguage restrictions e g TAUM METEO ATR This suggests the following two objectives for future NLP research automatic linguistic knowled...
متن کامل